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Abstract 

Markov basis for statistical model of contingency tables gives a useful tool for performing 
the conditional test of the model via Markov chain Monte Carlo method. In this paper we 
derive explicit forms of Markov bases for change point models and block diagonal effect mod- 
els, which are typical block-wise effect models of two-way contingency tables, and perform 
conditional tests with some real data sets. 

Keywords and phrases: block diagonal effect model, change point model, Markov chain Monte 
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1 Introduction 

Goodness-of-fit tests for statistical models of contingency tables are usually performed by the 
large sample approximation to the null distribution of test statistics. However, as shown in 
Haberman {(J, the large sample approximation may not be appropriate when the expected 
frequencies are not large enough. In such cases it is desirable to use a conditional testing 
procedure. In this paper we discuss a conditional testing via Markov chain Monte Carlo (MCMC) 
method with Markov bases. 

Diaconis and Sturmfels 0] showed the equivalence of a Markov basis and a binomial generator 
for the toric ideal arising from a statistical model of discrete exponential families and developed 
an algebraic sampling method for conditional distributions. Thanks to their algorithm, once 
we have a Markov basis for a given statistical model, we can perform a conditional test for the 
model via MCMC method. However, the structure of Markov bases are complicated in general. 
Many researchers have studied the structures of Markov bases in algebraic statistics (e.g. Dobra 
and Sullivant jij, Aoki and Takemura [H, Rapallo [3], Hara et al. 0]). 

In this paper we derive Markov bases for some statistical models of two-way contingency 
tables considering the effect of subtables. It is a well-known fact that the set of square-free 
moves of degree two (basic moves) forms the minimal Markov basis for complete independence 
model of two-way contingency tables. On the other hand, when a subtable effect is added to the 
model, the set of basic moves does not necessarily form a Markov basis. This problem is called 
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two-way subtable sum problem (Hara et al. [8J). In the previous researches statistical models 
with one subtable effect are considered. We consider some statistical models including several 
subtable effects. 

The organization of this paper is as follows. In Section [2] we summarize the notations and 
definitions on Markov bases and introduce two-way subtable sum problems. In Section [3] we 
derive the minimal Markov basis for the configuration arising from two-way change point model 
and discuss the algebraic properties of the toric ideal arising from the change point model. 
Section [5] and [5] give the explicit forms of Markov bases for the configurations arising from some 
block diagonal effect models. In Section [6] we apply the MCMC method with our Markov bases 
to some data sets and confirm that it works well in practice. We conclude the paper with some 
remarks in Section [JJ 

2 Two-way subtable sum problem 

In this section we summarize notations and definitions on Markov bases and give a brief review 
of two-way subtable sum problems. 

2.1 Preliminaries 

Let N = {0, 1, 2, . . . } and let x = {xij}, xij G N, i = 1, . . . , R, j = 1, . . . , C be an R x C two-way 
contingency table with nonnegative integer entries. Let I = | 1 < « < -R, 1 < J < C} be 

the set of cells. We order the elements of contingency table x lexicographically and regard x as 
a column vector. 

Let A be a T x \I\ zero-one matrix where T is a positive integer and \X\ = RC. We assume 
that the subspace of M' 2 '' spanned by the rows of A contains the |X|-dimensional row vector 
(1, . . . , 1). For a given t G N T the set of contingency tables 

T t = {xe N |x| \Ax = t} 

is called the t-fiber. An integer table z with Az = is called a move for A. Define the degree of 
move z as ||z||i/2 = Y2(tj)ex \ z ij\/^- Let Ma = { z j Az = 0} denote the set of moves for A. A 
subset B C Ma is called sign-invariant if z € B implies — z € B. Markov basis for A is defined 
as follows. 

Definition 1. A sign-invariant finite set of moves B C Ma is a Markov basis for A, if for any 
t and x,y € J~t(x ^ y) there exist U > 0, z Vl , . . . , z Vu € B such that 

U u 

y = x + ^2 z v s and x + ^ z Vs G T t for 1 < u < U. 

s=l s=l 

In this paper we only consider sign-invariant sets of moves as Markov bases. Since the row 
vector (1, . . . , 1) is contained in the subspace of R^l spanned by the rows of A, z .. >0 Zij = 

~ ^2(ij)eiz , <o z ij holds for every move z G Ma- A move z can be written as z = z + — z~ 
where z + = {max(zjj, 0)} and z~ = {max(— Zij, 0)}. If there exists a fiber Tt = {z + ,z~}, we 
say that z is an indispensable move. 

Suppose that x and y are in the same fiber Ft and the /i-norm — = Ylu j)ex \ Xi i — I 
is not equal to zero. We say that ||a; — can be reduced by a subset B C Ma if there exist 
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r + > 0,r > 0, r + + r > 0, and sequences of moves £ B, s = 1, . . . ,r + , and B s £ B 
s = 1, . . . , t~, satisfying 



a; - y 



< If - y li> 



+ ^5+£j t , r' = l,... 



=i 



y- £ J" t , r' = l,...,r . 

It is easy to see that a subset B C .A/f^ is a Markov basis for .Ma if \\x — y\\i can be reduced by 
B for all x and y in every fiber Tt- 

2.2 Two-way subtable sum problem 

In this subsection we introduce two-way subtable sum problems and give a brief review of 
previous researches. 

For a contingency table x denote the row sums and column sums of x by 



C 



1, . . . , R, x+j — ^ ] Xij, j — 1, . . . , C. 



i=l 



Let Si,... , Sn be subsets of 1 and define the subtable sums xs n ,n = 1, . . . ,N, by 



We summarize the set of row sums, column sums and the subtable sums as a column vector 

t = . . . ,X R+ ,X + l, . . . ,x + c,x Sl ,- ■ ■ ,x Sn )'- 

Then, with an appropriate zero-one matrix As, the relation between x and t is written by 

As 1 ,...,s N x = t. 

The set of columns of As 1 ,... t s N is a configuration defining a toric ideal Ia Si Sjv - For simplicity 
we call As lt ... t s N the configuration for Si, . . . , Sjy and abbreviate the set of moves M.a Si Sn 
for A Sl ,...,s N as M Sl ,...,s N - 

Consider a square-free move of degree 2 (basic move) of the form 

j f 
i +1 -1 
-1 +1 

for i ^ i' and j ^ j' . For simplicity we denote this move by (i, , j') — j'). Similarly 

a move of degree d is denoted by (h,ji) ■ ■ ■ {id, 3d) ~ {h, ji) • • • (id,3d-l)(ii,jd) for appropriate 
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ii, . . . , id and ji, . . . ,jd- Hara et al. [8[ and [7[ discussed Markov bases for the configuration As 
for one subtable SCI In [g] it is shown that the set of basic moves in Ai s is a Markov basis 
for As if and only if S is either 2x2 block diagonal or triangular. Ohsugi and Hibi (l2| discussed 
the same problem from algebraic viewpoint. 

In Sections [3H5] we use the notations on signs of subtables: Let x, y be the two contingency 
tables in the same fiber Tt and let z = x — y. Consider a subset 1CI, If Zij = for every 
£ i, we denote it by z(l) = 0. If z(Z) ^ and > for 3(i,j) € I, we denote it by 
z(T) > 0. Other notations such as z(I) > 0, z(X) < and .z(I) < are defined in the same 
way. 



3 Markov bases for the change point models 

In this section we derive Markov bases for the configuration arising from the change point model 
of two-way contingency tables and discuss its algebraic properties. 



3.1 The unique minimal Markov bases for the change point models 

In this subsection we derive the unique minimal Markov bases for the change point models. 



Two-way change point models with an unknown change point is introduced in Hirotsu ICj]. We 
consider the two-way change point models with several fixed change points. We call a subtable 
SCIa rectangle in I, if S has a form 

S = | ai < i < a 2 ,bi < j < b 2 } 

for 1 < a\ < a 2 < R and \ <b\ < b 2 < C. Let S n , n = 1, . . . , N, be rectangles of I satisfying 
Si C S 2 C • • • C Sn C 1. Then the change point model is defined by 

N 

logpij = fl + ai+Pj + ^ J lnIs n {hj): (1) 

n=l 

where Is n {hj) = 1 if £ S n and Is n (i,j) = otherwise. The sufficient statistic for this 
model consists of the row sums, the column sums and the sums of frequencies in S n , n = 1, . . . , N. 
The first main result of this paper is stated as follows. 

Theorem 1. Let S n , n = 1, . . . , N, be the rectangles of I with S± C S 2 C • • • C Sn C I. The set 

of basic moves in Ms-l,...,S n is the unique minimal Markov basis for the configuration As lt ,.. t s N - 

Proof. After an appropriate interchange of rows and columns, we may assume that 5^, n — 
1, . . . , N, are the subsets of 1 defined by 

S n = | 1 < i < r n ,l < j < c n }, n = l,...,N, 

where each r n , c n , n = 1, . . . , N, is a positive integer with 1 < r\ < • • • < r^ < R and 1 < c\ < 
• • • < cat < C. Since Si C 52 C • • • C Sn C I, at least one of r n < r n+ i or c n < c n+ \ holds for 
each n = 1, . . . , N — 1 and at least one of r^r < R or r^r < C holds. 

Suppose -z(Si) ^ 0. Then z contains both of positive cell and negative cell in Si. Denote 
these cells by (i,j) and (i',j'). If j = j', letting (i,j") be a negative cell in the i-th row, 
||z||i can be reduced by a basic move (i, j)(i' , j") — (i' , j)(i, j") G Ms 1 ,...,s N - Let us consider 
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the case of i ^ i' and j ^ f. Let (i, j") and be negative cells in the i-th. row and in 

the j-th column, respectively. Then ||z||i can be reduced by a sequence of two basic moves 
(i,f)(i',j") ~ (i'J')(i,j") and (i,j)(i",f) - {%" J){i,f). For the case of r N < R and c N < C, 
it can be shown by the same argument that if z contains both of positive cell and negative cell 
in X := \ rjy < i < R, cn < j < C}, then ||z||i can be reduced by the set of basic moves 

in M-Sj^.^Sn- If r N = R or cat = C holds, say = R, z contains both of positive cell and 
negative cell in Z \ Sjy- Then ||z||i can be reduced by the set of basic moves in A4s 1 ,...,s N - 

Consider the case of z(Si) = 0. We claim that if z(S n -\) = and z(S n ) ^ for 1 < 3n < N, 
then |jz||i can be reduced by the set of basic moves in A4s lt ...,s N - If either r n __i = r n or c n _i = c n 
holds, we see that ||z||i can be reduced by a basic move in M.Si,...,S N - For the case of r n _i < r n 
and Cn_i < c n , let S^ 2 = | 1 < i < r n „i,c n _i < j < c n }, S^ 1 = \ r n _i < i < 

r n ,l < j < c n _i} and S% 2 = \ r n _i < i < r n ,c n -i < j < c n }. If z contains both of 

positive cell and negative cell in one of S^ 2 , S 21 or S 22 , it can be similarly shown that ||z||i can 
be reduced by the set of basic moves in Ais 1 ,...,s N - Then we only need to consider the case of 
z(S* 1 ) > or z(S%) < for each (k,l) G {(1, 2), (2, 1), (2, 2)}. Without loss of generality we 
can assume S^ 2 > 0. Let be a positive cell in S^ 2 and let be a negative cell in the 

j-th column. If G S 22 , using a negative cell in the i-th row, ||z||i can be reduced 

by a basic move — G Ms^^^.^Sn- Suppose S 22 . There exists a 

negative cell G S 21 U S 22 . Then ||z||i can be reduced by a sequence of two basic moves 

j") ~ i j) an d (h i j') ~ (*' \j)(hf)- Therefore the claim is proved. 

The remaining part is the case that z(Sn) = and one of z(I) > or z(Z) < holds. If 
z(I) = 0, z contains a nonzero cell in \ < i < R, 1 < j < c^} or {(i, j) \ 1 < i < 

vn,cn < j < C}. It is easy to see that ||z||i can be reduced by a basic move in Msi,...,s N - 
Suppose z(X) > and let (i, j) be a positive cell in 1. There exist a negative cell and a 

positive cell with i' ^ i in \ rjv < i < R, 1 < j ; < cat}. Then ||z||i can be reduced 

by a basic move (i,j)(i',f) - (i' 

Since every basic move in Ais!,...,s N is indispensable, the set of basic moves in A / (5 1j ... i 5 jv is 
the unique minimal Markov basis for As 1 ^...^s N (see [IE]). □ 



3.2 Algebraic properties of the configuration arising from the change point 
model 

In this subsection we investigate the algebraic properties of the configuration A5 1; ... i 5 JV arising 
from the change point model. 

Let K be a field and let K[{ui}i<i<R U {vj}i<j<c U {w n }i< n <N+i] be a polynomial ring in 
R + C + N variables over K. We associate each cell (i, j) G S n \ 5 n _i, 1 < n < N + 1, to a 
monomial UiVjW n where So = and Sn+i = I- Define Rs 1 ,...,s N as a semigourp ring generated 
by those monomials. Let K[x] = K[{xij}^j^ e x] be a polynomial ring in RC variables over K. 
Define a surjective map tt : K[x] — > Rs lt ...,s N by ir(xij) = UiVjW n for 1 < n < N + 1. Define 



the toric ideal for the change point model as the kernel of tt and denote it by Is 1 ,...,s N - See [14] 
and [3] for general facts on toric ideals and their Grobner bases. From Theorem [T] we already 
know that the toric ideal Is 1 ,...,s N is generated by the quadratic binomials corresponding to 
basic moves in Msi,...,s N - Furthermore we have the following Theorem [2j Although its proof 
is similar to 12|, we need to use a lexicographic order different from the order used in [1 21 ] . 



In fact the toric ideal I,g, ,<? r does not have a quadratic Grobner basis with respect to the 



lexicographic order used in 12| if iV > 2 and there exist m, n such that 2<m<n<A^ + l and 
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r m -i < r m ,c m -i < c m ,r n „i < r n ,c n _i < c n where r N+1 = R and c N+1 = C. 



Theorem 2. For the toric ideal Isi,...,s N the following statements hold: 

(i) Ig 1 ... s N possesses a quadratic Grobner basis; 

(ii) Isx ... Sn possesses a square-free initial ideal; 
(Hi) Rs 1 ,...,s N is normal; 

(iv) Rs 1 ,...,s N is Koszul. 

Proof. Generally, (i) =>■ (iv) and (ii) => (iii) hold. Since Rs 1 ,...,s N is generated by the monomials 



of the same degree, (i) (ii) holds from the proof of Proposition 1.6 in Ohsugi and Hibi [111 ]. 
Therefore it suffices to show that the statement (i) holds. 

By an appropriate interchange of rows and columns, we may assume that S n , n = 1, . . . , N, 
share their upper- left corners. From Theorem [T] the toric ideal Is 1 ,...,s N is generated by 

g = {x ik Xji - x u x jk \ I < i < j < R, I <k <l < C,n(x ik Xji) = n(xux jk )}. 

Fix a lexicographic order >- satisfying xrc >~ xrc-i >-•••>- xr\ >~ xr^ic >-■■■>- x\\. Then 
XikXji is the initial monomial of xn~Xji — xuXjk, l<i<j<R,l<k<l<C. We prove that Q 
is a Grobner basis of Is 1 ,...,s N with respect to >- using Buchberger's criterion. 

Let / be the 5-polynomial of (71,(72 G Q- Suppose that / is not reduced to zero by Q. 
By Proposition 4 in Section 9 of Chapter 2 in 0], the initial monomials of gi and 52 are not 
relatively prime. On the other hand, if the monomials of / share a common variable / is reduced 
to zero by Q. Then / is a cubic binomial and is represented as / = x^^x^x^ — x i' 1 i' 1 x i' 2 i' 2 x i' 3 i' 3 
with the initial monomial x^^x^Xi^. Since / G Is lr ..,s N , we have {ii,i2,h} = {*ij*2>*3j 
and {h,h,h} = {l'iJ'2 ^3}- Since the monomials of / have no common variable, | {*i , *2 > *3 } I = 
I {^l ^2,^3} I = 3. We assume 1 < i± = < 12 = i'2 < h = i'3 < R without loss of generality. By 
the definition of >~, I3 > 1' 3 £ {l±, 12}- Then / is represented as one of the following forms: 

(1) Xi 1 j 1 Xi 2 j 2 Xi 3 j 3 

1 / \ T* ■ • T* ■ ■ T* ■ • T* ■ ■ HP • 

\^ ) ^iiji ^1131 13J3 71 1^*^*0 

I ~\ I HP ■ • T* ■ ■ HP ■ • HP ■ ■ HP • 

(4) Xi 1 j 2 Xi 2 j 1 X{ 3 j 3 

I F\ ) ry . . ry . . ry . . ry . . ry . 

(6) Xi 1 j 3 Xi 2 j 1 Xi 3 j 2 

where 1 < i\ < «2 < 23 < R an d 1 < ji < j'2 < J3 < C. The candidates (l)-(6) of the form of / 
are obtained as follows: Suppose l\ < I2 < I3 and 1' 3 = l±. By I2 7^ 1' 2 we have (^,Z 2 ) = 
This implies that / corresponds to the type (1). The forms (2)— (5) are obtained by the same 
argument. 

For each form of (l)-(6), if there exists a quadratic binomial / such that in^(f) divides 
in^(f), then / can be reduced to a cubic binomial whose two monomials share a common 
variable. Hence such / does not belong to Q. We derive a contradiction for each form of /. 
Note that x^Xji — xnXj k (£Q,l<i<j<R,l<k<l<C,is equivalent to the existence 
of n, 1 < n < N, such that (i, k) € S n \ S n -i and (j, k), (J, I) S n . We refer to this 
equivalence by (*). 
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(1) Consider a quadratic binomial / = Xi 1 j 1 x i2 j 2 — Xi 1 j 2 x i2 j 1 and let S n \ S n -\ be a subtable 

containing (i 1 , Ji ) - Since / g" Q and (*), J2), (^2,^3), (^3, ji) Sn. This contradicts 
/ € Is u ...,S N - 

(2) Consider a quadratic binomial / = Xi 1 j 1 Xi 2 j 2 — Xi 1 j 2 Xi 2 j 1 and let S n \ S n -i be a subtable 

containing , Ji ) - Since f $ G and (*), (ii, j'3), (*2 , Ji), (^3, J2) ^ Sn- This contradicts 
/ € /5i,...,5jv- 

(3) Consider two quadratic binomials /1 = Xi 1 j 1 x i2 j 3 — Xi 1 j 3 x i2 j 1 and / 2 = Xi 1 j 1 x i . 3 j 2 — x il j 2 x i . 3 j 1 . 

Let 5 n \ S n -i be a subtable containing (ii,jx). Since f\ G and (*), (^2 , Ji ) S n . 
Since / 2 ^ and (*), (n,j 2 ) ^ Sn- Then (ii, j 3 ), (i 2 , j 2 ), fe, Ji) Sn, which contradicts 
/ G /s , i,...,S'jv 

(4) Consider two quadratic binomials /1 = Xi 2 j 1 Xi 3 j 3 — Xi 2 j 3 Xi 3 j 1 and /2 = Xi 1 j 2 Xi 3 j 3 Xi 1 j 3 Xi 3 j 2 . 

Let 5 n \ 5 n _i be a subtable containing Since f\ g" G, (*) and / € Isi,...,S N > 

G S n \S n -i. Similarly (ii, j 2 ) G S n \S n -i follows from / 2 g" (*) and / G Is u ...,S N - 
These contradict / G Isi,...,S N - 

(5) Consider two quadratic binomials /1 = Xi 2 j 1 Xi 3 j 3 — Xi 2 j 3 Xi 3 j 1 and / 2 = Xi 1 j 2 Xi 3 j 3 — Xi 1 j 3 Xi 3 j 2 . 

Let 5 n \ S n -i be a subtable containing (12,31)- Since fi^G and (*), (i 2 > J3)> (*3 5 ji) ^ Sn 
and (ii,j 3 ) 0^5 n . Since f e G, S n \ S n _i contains (i 2 , J2 ) • Similarly (ii,j 2 ) G S n \ S n _i 
follows from /2 G" £/, (*) and / G Is-l,...,S n - This contradicts / G Is 1 ,...,S N - 

(6) Consider a quadratic binomial / = X{ 2 j 1 Xi 3 j 2 — X{ 2 j 2 Xi 3 j 1 and let 5„ \ S n ,_i be a subtable 

containing (12,31)- Since / G and (*), (ii,i 2 ), (*2 ? J3 ) ? (h,jl) & S n . This contradicts 
/ G ^5i,...,Siv 

Therefore is a Grobner basis of Is lt ...,s N with respect to >-. □ 



4 Markov bases for common block diagonal effect models 

In this section we introduce the common diagonal effect model of two-way contingency tables 
and derive its Markov basis. 

Let S denote the set of cells belonging to the diagonal blocks defined by 

S = \r n <i < r n+ i, c n < j < c n+ i, 1 < 3n < N}, 

where each r n ,c n ,n = 1, . . . , N + 1, is a non- negative integer with 1 = r% < r 2 < ■ ■ ■ < rw+i — 
R+l and 1 = c\ < c 2 < • • • < cn + \ = C+l. is an iV x iV block diagonal set in the contingency 
table. In the common block diagonal effect model, the cell probabilities {pij} are defined by 

log pij = n + ai + Pj +jsls(i,j)- (2) 

In the model ([2]), all cells in diagonal blocks have the same parameter 75. The sufficient statistic 
for ([2]) consists of the row sums, the column sums and the sum of frequencies in S. Note that 
the model ([2]) is a generalization of the common diagonal effect model whose Markov basis is 
discussed in Hara et al. {?J • 

Since Hara et al. 0] showed that for N = 2 the set of basic moves in Ms is a Markov basis 
for As, we assume N > 3 in this section. In order to describe a Markov basis for As we need 
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Figure 1: Block- wise indices. 



some more notations. We index each block as in Figure [H i.e., T^i = \ < i < r^+i, c\ < 

j < ci + i} for 1 < k, I < N. Note that S can be represented as S = In U I22 U • • • U Inn- 
Consider the following types of moves. 

• Type I (square-free moves of degree 2): 

h h 
h +1 -1 
i 2 -1 +1 

where i\ / ii and j\ 7^ j'2. 

• Type II (square-free moves of degree 3) : 

3i h h 
h +1 -1 
i 2 -1 +1 

h +1 -1 

where nonzero cells (i.e. ±1) belong to distinct blocks in S c ' . 

• Type III (square- free moves of degree 3): 

3i h h 
h +1 -1 
h -1 +1 
i-i -1 +1 

where (h,ji) and (i 2 , 32) belong to distinct blocks in S and other nonzero cells belong to 
distinct blocks in S c . 

• Type IV (moves of degree 4) : 

3i 32 33 J4 
h +10-10 

1 2 +1 -1 

1 3 0-1+10 

1 4 -1 +1 
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where (i\,ji) and (23, J2) belong to distinct blocks in 5 and other nonzero cells belong to 
(not necessarily distinct) blocks in S . The ii-th and i2-th rows belong to the same block 
of rows. Similarly the 23-th and i^-th. rows belong to the same block of rows. There are 
both square-free and non-square-free moves of this type. Type IV includes the transpose 
of these moves. 

We now give a Markov basis for As with its explicit form as follows. 

Theorem 3. The set of moves of Types I- VI in Ms forms a Markov basis for the configuration 
A s - 

We establish Theorem [3] by the lemmas below. Suppose that x and y belong to the same 
fiber Tt and let z = x — y. The first lemma is proved by the same argument as Lemma 2 of 0j 
and we omit its proof. 

Lemma 1. Suppose that z contains a block Iki such that there exist two cells (i,j), (i',f) € T^i 
with Zij > and z^y < 0. Then \\z\\i can be reduced by moves of Type I in Ms- 

By this lemma from now on we assume that every block T^i in z, 1 < k, I < N, satisfies 
z{Tm) > or z(Zki) < 0. Let K = {k | 1 < k < N,z(Zkk) > 0} denote the set of indices of 
positive diagonal blocks and let L = {I \ 1 < I < N, z(In) < 0} denote the set of indices of 
negative diagonal blocks. 

Lemma 2. If z(S) = 0, then z can be reduced by a move of Type I or II in Ms- 

Proof. After an appropriate block-wise interchange of rows and columns, we assume z{T\2) > 
and Xn,X22 C S without loss of generality. Let (i,j) be a positive cell in I12 as shown in Figure 
[2j Then z contains negative cells in the i-th row and the j-th column. By an appropriate 
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Figure 2: z with z(S) = 0. 

block-wise interchange of rows and columns, we assume that two of these cells belong to X13 and 
X32, respectively. Note that I33 may or may not be contained by S. Denote the two negative 
cells by {i, j') and If Z33 <2 S, \\z\\\ can be reduced by (i,j)(i',f) — (i' ,j)(i,f). Hence let 

us consider the case of X33 C S. Since z+f = 0, the j'-th column contains a positive cell. By an 
appropriate block-wise interchange of rows, we assume that this positive cell belongs to I23 as 
in Figure [3l Here, I22 may or may not be contained by S. Denote the positive cell by If 
Z22 % S, z can be reduced by (i, j)(i" , j') — (i" , j'). If I22 ^ 5, there exists a negative cell 
(*",/) G Za,l* 2,3 and \\z\h can be reduced by - (i',j)(i"j")(i,f). □ 

Lemma 3. Suppose that z{Zf t k) > 0,z(lu) < and z contains a cell (i,j) € Ikk w ith z ij > 
such that 

ri < Mi' < ri + i : zyj > and c\ < Vj' < q + i : z^/ > 0. 
Then \\z\\i can be reduced by a move of Type III in Ms- 
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Figure 3: z with z(S) = 0. 



Proof. After an appropriate block-wise interchange of rows and columns, we assume that k = 
and I = 2. Let be a negative cell in In. Since Zij > and Zj+ = z + j = 0, there exist 

two positive cells with &Xkk,Zlk and {hf) t^hk^H as in FigureH Hence, 

||z||i can be reduced by (i,j)(i",j')(i',j") - (i",j)(i',f) (*',/')• n 



Figure 4: z with the condition of Lemma E3 

Lemma 4. Suppose that > 0, € Ikk and there exists I € L satisfying 

n < 3i' < r i+1 : z V j < and z(l M ) > 



or 



ci < 3f < C/+1 : Ziji < and z(I/ fc ) > 0. 
Then \\z\\i can be reduced by a move of Type III or IV in Ms- 

Proof. After an appropriate block-wise interchange of rows and columns, we assume that k = 
1, 1 = 2, r-i < 3i' < r3 : z^j < and z(T±2) > 0. If there exists a pair of cells (ii,f) G I±2 and 
fej') € X22 with Zjjj/ > 0,z i2 j> < as in Figure [5l ||z||i can be reduced by (i,j)(i',j2)(h, j')^, 
(*') j)(^i) J2)(^2ji')(^jii)- ^ there exists no such pair, z satisfies z(In) > 0,z(I 2 2) < and 



3 3' 



3i 32 



1 

H 

h 
1' 



Figure 5: z with the condition of Lemma [H 



there exists a cell (i',j r ) € Z22 with 



> such that r\ < Vi" < r2 : —Zi"j> > and 
ci < Vj" < C2 : —Zi>j» > 0. Hence, by Lemma[3l ||z||i can be reduced by a move of Type III. □ 



-Zi'ji 
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Proof of Theorem^ By Lemmas [TJ and HI it is enough to show that \\z\\i can be reduced by 
a move of types I-IV under the following conditions: 

• z(l kl ) > or z(l kl ) < holds for all 1 < k, I < N and z(S) ^ 0. 

• For every k S K , I € L and € l k k with Zij > 0, 

ri < Mi' < n + i : Zi'j > or z(T M ) < 

and 

Q < V/ < Q+i : > or z(X Jfc ) < 0. 

For such z fix k € if and Z G L and consider the case that the above conditions are satisfied 
by z(Iki) < and z(Xi^) < as in (a) of Figure [6l In Figure [6] we assume that k = 1 and 
/ = 2 without loss of generality. In this case ||z||i can be reduced by a move of Type III from 



+ 


0- 




+ 


+ 




0- 

























(a) z(I fc/ ),z(I Jfc ) < (b) z(l kl ) > 
Figure 6: z with the conditions in the proof of Theorem [3l 

the sign-reverse case of Lemma [3j Finally, consider the case that at least one of z{Z k \) < and 
z^Lik) < does not hold. Let z(Tki) > as in (b) of Figure [H It is obvious from Lemma[3]that 
||z||i can be reduced by a move of Type III. □ 

5 Markov bases for general block diagonal effect models 

In the common block diagonal effect model we assume that every diagonal block has the common 
parameter 7. In this section we discuss the case that each diagonal block S n has its own 
parameter 7„ and more general cases of block diagonal effect. 

We introduce the following model for block diagonal effect by the slight modification to ([2]). 
Let S n ,n = 1, . . . , N, be the set of cells belonging to the n-th diagonal block defined as 

S n = I r n < i < r n +i, c„ < j < c n+ i}. 

Then the block diagonal effect model with block- wise parameters j n , n = 1, . . . , N, is defined by 

N 

logpij = fi + oti + + ^2 7n-Ts„(i, j)- (3) 

71=1 

Note that the model (J3j) contains the quasi-independence model considered in Hara et al. [?J as 
a special case. 

The sufficient statistic for the model (|3|) consists of the row sums, column sums and the sums 
of frequencies in the block diagonal sets S n ,n = 1, . . . , N, and is summarized as 

t = . .,x R+ ,x +1 , . . . ,x +c ,x Sl ,. . . ,x Sn )'. 
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Then a Markov basis for As 1 ... Sn is obtained by essentially the same arguments in the proof of 
Theorem O 

Proposition 1. If N = 2, the set of moves of Type I in Msi,....S N forms the unique minimal 
Markov basis for the configuration As 1 Sn- If N > 3, the set of moves of Types I and II in 
M 

Si,...,Sn forms the unique minimal Markov basis for the configuration Ag ls „ t) s N . 

Proof. When N = 2, it is easy to see that fixing the sums in t is equivalent to fixing the sums in 
t and the sums of I12 and Zai- Then, if z 7^ there exists a block containing both a positive cell 
and a negative cell. Hence by the same argument in the proof of Lemma [U ||z||i can be reduced 
by a move of Type I. 

Let us consider the case of N > 3. If there exists a diagonal block z(S n ) 7^ for 1 < n < N, 
S n contains both a positive cell and a negative cell. Then, by the same argument in the proof of 
Lemma[n ||z||i can be reduced by a move of Type I. In the case that z(S n ) = 0, n = 1, . . . , N, 
by the same argument in the proof of Lemma [21 ||z||i can be reduced by a move of Type II. 

The uniqueness of the minimal Markov basis follows, since the basic moves and the moves 
of Type II in Ais 1 ,...,s N are indispensable. □ 

In Theorem in Section H] and Proposition [H we assumed r n+ i = R + 1 and c n+ \ = C + 1 in 
the definition of subtables. In fact, the set of moves of Types I - IV forms a Markov basis for the 
configurations arising from more general block diagonal effect models. Let 1 = n < r 2 < • • • < 
rn+i < -R+l, 1 = ci < c 2 < • • • < c N+ i < C+l andSn = {(i,j) \ r n < i < r n+1 ,c n <j< c n+ i}. 
Denote S = {Si, . . . , Sn}- Let T q , q = 1, . . . , Q, be the subtables of X of the form T q = UneAfq *~' n 
where J\ q is a subset of {1, 2, . . . , iV} with M q n J\ q > = for 1 < q < q' < Q. Then the general 
block diagonal effect model is defined by 

Q 

logPij = fM + ai + Pj + ^qItq ihj)- ( 4 ) 

q=l 

The sufficient statistic of the model is summarized as 

(Xl + , . . .,X R+ ,X +1 , . . .,X+C,X Tl ,. ■ ■ > XTqY ■ 

By the same argument of the proofs of Theorem [3] and Proposition [lj we have the following 
corollary. 

Corollary 1. The set of moves of Types I-VI in Mti,...,Tq forms a Markov basis for the con- 
figuration AT lt ... t Tn- 

6 Numerical experiments 

In this section we apply the MCMC method with the Markov bases derived in the previous 
sections for performing conditional tests of some data sets. 

The first example is Table [T] which shows the relationship between school and clothing for 
1725 children. This data is from Gilby and Pearson 0]. Each row represents a primary school 
of the usual county-council type. The rows are arranged in ascending order of the wealth of 
the children. The children are also classified by their clothing and the columns are arranged 
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Table 1: Relationship between school and clothing. 





T 
1 


TT 
11 


TTT 
in 


TV Hr V 
1 V 66 V 




No. 1 


86 


49 


10 


1 


146 


No. 2 


102 


116 


24 


3 


245 


No. 3 


25 


19 


2 





46 


No. 4 


137 


98 


33 


4 


272 


No. 5 


209 


222 


73 


16 


520 


No. 6 


65 


154 


71 


27 


317 


No. 7 


9 


33 


1 


1 


44 


No. 8 


3 


60 


51 


21 


135 


Total 


636 


751 


265 


73 


1725 



in ascending order of the neatness of their clothing. We set the model ([I]) with two subtables 
Si = {(i, 1) | 1 < i < 3}, S2 = {(hj) I 1 < i < 5, 1 < j < 2} as a null hypothesis. Starting from 
the observed data in Tabled] we run a Markov chain of 100,000 tables including 10,000 burn-in 
steps and compute the chi-square statistic for each sampled table. The histogram of chi-square 
statistics is shown in Figure In the figure the black line shows the asymptotic distribution 
Xig- Since the observed data is large enough, the estimated exact distribution is close to Xig- 
For the observed data in Table Q] the value of chi-square statistic is 154 and the approximate 
p-value is essentially zero. Therefore the change point model (pQ) is rejected at the significant 
level of 5%. 




1 1 1 1 1 

10 20 30 40 50 



Chi-square statistic 
Figure 7: A histogram of chi-square statistic. 

The second example is Table [2] which shows the relationship between birthday and deathday 
for 82 descendants of Queen Victoria. This data is from Diaconis and Sturmfels 0]. Let rj = 
Cj = 1 + 3(i — 1) for i = 1, . . . , 5, permitting the replication modulo 12. We test the common 
block diagonal effect model ([2]) against the block diagonal model ([3]) with several parameters. 
Starting from the observed data in Table [2] we run a Markov chain of 1,000,000 tables including 
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Table 2: Relationship between birthday and deathday. 





Jan 


.teb 


March 


A "1 

April 


May 


June 


T 1 

July 


Aug 


bep 


Uct 


JNov 


Dec 


Jan 


1 











1 


2 








1 





1 





Feb 


1 








1 

















1 





2 


iviarcn 


i 
i 


n 
u 


u 


u 


9 


i 


u 


n 
u 


n 
u 


u 


n 
u 


i 
i 


April 


3 





2 











1 





1 


3 


1 


i 


May 


2 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 





June 


2 











1 























July 


2 





2 


1 














1 


1 


1 


2 


Aug 











3 








1 








1 





2 


Sep 











1 


1 

















1 





Oct 


1 


1 





2 








1 








1 


1 





Nov 





1 


1 


1 


2 








2 





1 


1 





Dec 





1 


1 











1 


















100,000 burn-in steps and compute (2x) the log-likelihood ratio statistic 

Em% 
Xij log -rj- 

(i,j)ex v 

for each sampled table x = {x^} where m|,- and mfj denote the expected cell frequencies under 
the model ([2j) and ([3]), respectively. The histogram of log-likelihood statistics is shown in Figure 
[8l In the figure the black line shows the asymptotic distribution x|- From the sparsity of Table 
[2] the estimated exact distribution is different from the asymptotic distribution xt- F° r the 
observed data in Table [2] the value of log-likelihood ratio statistic is 3.07 and the approximate p- 
value is 0.43. Therefore the common diagonal block effect model ((2|) is accepted at the significant 
level of 5%. 



CD 
Q 



CD 

d 



CM 

d 



5 10 15 20 

Log-likelihood ratio statistic 



Figure 8: A histogram of log-likelihood ratio statistic. 
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7 Concluding remarks 



In this paper we derive the explicit forms of the Markov bases for some statistical models with 
block-wise subtable effects and perform the conditional testing with some real data sets. For 
the change point model we also discussed the algebraic properties of the configuration arising 
from the model. Hara et al. @] gave the necessary and sufficient condition on the subtable so 
that the set of basic moves forms a Markov basis. It is of interest to consider a necessary and 
sufficient condition on subtables Si, . . . , Sjv so that the set of basic moves forms a Markov basis. 
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